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Description 

This invention relates to the field of receptors belonging to the superfamily of nuclear hormone receptors, in par- 
ticular to steroid receptors. The invention relates to DNA encoding a novel steroid receptor, the preparation of said 
5 receptor, the receptor protein, and the uses thereof. 

Steroid hormone receptors belong to a superfamily of nuclear hormone receptors involved in ligand-dependent 
transcriptional control of gene expression. In addition, this superfamily consists of receptors for non-steroid hormones 
such as vitamine D, thyroid hormones and retinoids (Giguere et al, Nature 330, 624-629, 1987; Evans, R.M., Science 
240, 889-895,1 988). Moreover, a range of nuclear receptor-like sequences have been identified which encode socalled 
10 'orphan' receptors: these receptors are structurally related to and therefore classified as nuclear receptors, although 
no putative ligands have been identified yet (B.W. O'Malley, Endocrinology 125, 1119-1170, 1989; D.J. Mangelsdorf 
and R.M. Evans, Cell, 83, 841-850, 1995). 

The superfamily of nuclear hormone receptors share a modular structure in which six distinct structural and func- 
tional domains, A to F, are displayed (Evans, Science 240, 889-895, 1 988). A nuclear hormone receptor is characterized 
'5 by a variabel N-terminal region (domain A/B), followed by a centrally located, highly conserved DNA-binding domain 
(hereinafter referred to as DBD; domain C), a variable hinge region (domain D), a conserved ligand-binding domain 
(herein after referred to as LBD; domain E) and a variable C-terminal region (domain F). 

The N-terminal region, which is highly variable in size and sequence, is poorly conserved among the different 
members of the superfamily This part of the receptor is involved in the modulation of transcription activation (Bocquel 
20 et al, Nucl. Acid Res., 17, 2581-2595, 1989; Tora et al, Cell 59, 477-487, 1989). 

The DBD consists of approximately 66 to 70 amino acids and is responsible for DNA-binding activity: it targets the 
receptor to specific DNA sequences called hormone responsive elements (hereinafter referred to as HRE) within the 
transcription control unit of specific target genes on the chromatin (Martinez and Wahli, In 'Nuclear Hormone Receptors', 
Acad. Press, 125-153, 1991). 

25 The LBD is located in the C-terminal part of the receptor and is primarily responsible for ligand binding activity. In 

this way, the LBD is essential for recognition and binding of the hormone ligand and, in addition possesses a transcrip- 
tion activation function, thereby determining the specificity and selectivity of the hormone response of the receptor 
Although moderately conserved in structure, the LBD's are known to vary considerably in homology between the indi- 
vidual members of the nuclear hormone receptor superfamily (Evans, Science 240, 889-895, 1 988; PJ. Fuller, FASEB 

30 j. t 5, 3092-3099, 1991; Mangelsdorf et al, Cell, Vol. 83, 835-839, 1995). 

Functions present in the N-terminal region, LBD and DBD operate independently from each other and it has been 
shown that these domains can be exchanged between nuclear receptors (Green et al, Nature, Vol 325, 75-78, 1 987). 
This results in chimeric nuclear receptors, such as described for instance in WO-A-8905355. 

When a hormone ligand for a nuclear receptor enters the cell by diffusion and is recognized by the LBD, it will bind 

35 to the specific receptor protein, thereby initiating an allosteric alteration of the receptor protein. As a result of this 
alteration the ligand/receptor complex switches to a transcriptionally active state and as such is able to bind through 
the presence of the DBD with high affinity to the corresponding HRE on the chromatin DNA (Martinez and Wahli, 
'Nuclear Hormone Receptors',125-153, Acad. Press, 1991). In this way the ligand/receptor complex modulates ex- 
pression of the specific target genes. The diversity achieved by this family of receptors results from their ability to 

40 respond to different ligands. 

The steroid hormone receptors are a distinct class of the nuclear receptor superfamily, characterized in that the 
ligands are steroid hormones. The receptors for glucocorticoids (GR), mineralcorticoids (MR), progestins (PR), andro- 
gens (AR) and estrogens (ER) are classical steroid receptors. Furthermore, the steroid receptors have the unique 
ability upon activation to bind to palindromic DNA sequences, the so-called HRE's, as homodimers. The GR, MR, PR 

45 and AR recognize the same DNA sequence, while the ER recognizes a different DNA sequence. (Beato et al, Cell, 
Vol. 83, 851 -857, 1 995). After binding to DNA, the steroid receptor is thought to interact with components of the basal 
transcriptional machinery and with sequence-specific transcription factors, thus modulating the expression of specific 
target genes. 

Several HRE's have been identified, which are responsive to the hormone/receptor complex. These HRE's are 
so situated in the transcriptional control units of the various target genes such as mammalian growth hormone genes 
(responsive to glucocorticoid, estrogen, testosterone), mammalian prolactin genes and progesterone receptor genes 
(responsive to Estrogen), avian ovalbumin genes (responsive to progesterone), mammalian metallothionein gene (re- 
sponsive to glucocorticoid) and mammalian hepatic a^-globulin gene (responsive to estrogen, testosterone, gluco- 
corticoid). 

55 The steroid hormone receptors have been known to be involved in embryonic development, adult homeostasis as 

well as organ physiology. Various diseases and abnormalities have been ascribed to a disturbance in the steroid hor- 
mone pathway. Since the steroid receptors exercise their influence as hormone-activated transcriptional modulators, 
it can be anticipated that mutations and defects in these receptors, as well as overstimulation or blocking of these 
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receptors might be the underlying reason for the altered pattern. A better knowledge of these receptors, their mecha- 
nism of action and of the ligands which bind to said receptor might help to create a better insight in the underlying 
mechanism of the hormone signal transduction pathway, which eventually will lead to better treatment of the diseases 
and abnormalities linked to altered hormone/receptor functioning. 

s For this reason cDNA's of the steroid and several other nuclear receptors of several mammalians, including hu- 

mans, have been isolated and the corresponding amino acid sequences have been deduced, such as for example the 
human steroid receptors PR, ER, GR, MR, and AR, the human non-steroid receptors for vitamine D, thyroid hormones, 
and retinoids such as retinol A and retinoic acid. In addition, cDNA's encoding well over 100 mammalian orphan re- 
ceptors have been isolated, for which no putative ligands are known yet (Mangelsdorf et al, Cell, \fo1.83, 835-839, 

10 1 995). However, there is still a great need for the elucidation of other nuclear receptors in order to unravel the various 
roles these receptors play in normal physiology and pathology. 

The present invention provides for such a novel nuclear receptor. More specific, the present invention provides 
1or novel steroid receptors, having estrogen mediated activity. Said novel steroid receptors are novel estrogen recep- 
tors, which are able to bind and be activated by, for example, estradiol, estrone and estriol. 

is According to the present invention it has been found that a novel estrogen receptor is expressed as an 8 kb 

transcript in human thymus, spleen, peripheral blood lymphocytes (PBLs), ovary and testis. Furthermore, additional 
transcripts have been identified. Another transcript of approximately 10 kb was identified in ovary, thymus and spleen. 
In testis, an additional transcript of 1 .3 kb was detected. These transcripts are probably generated by alternative splicing 
of the gene encoding the novel estrogen receptor according to the invention. 

20 Cloning of the cDNA's encoding the novel estrogen receptors according to the invention revealed that several 

splicing variants of said receptor can be distinguished. At the protein level, these variants differ only at the C-terminal 
part. 

cDNA encoding an ER has been isolated (Green, et al, Nature 320, 134-139, 1986; Greene et al, Science 231, 
1150-1154, 1986), and the corresponding amino acid sequence has been deduced. This receptor and the receptor 

25 according to the present invention, however, are distinct, and encoded for by different genes with different nucleic acid 
sequences. Not only do the ER of the prior art (hereinafter referred to as classical ER) and the ER according to the 
present invention differ in amino acid sequence, they also are located on different chromosomes. The gene encoding 
the classical ER is located on chromosome 6, whereas the gene encoding the ER according to the invention was found 
to be located on chromosome 1 4. The ER according to the invention furthermore distinguishes itself from the classical 

30 receptor in differences in tissue distribution , indicating that there may be important differences between these receptors 
at the level of estrogenic signalling. 

In addition, two orphan receptors, ERRa and ERRp, having an estrogen receptor related structure have been 
described (Giguere et al, Nature 331, 91-94, 1988). These orphan receptors, however, have not been reported to be 
able to bind estrodial or any other hormone that binds to the classical ER, and other ligands which bind to these 

35 receptors have not been found yet. The novel estrogen receptor according to the invention distinguishes itself clearly 
from these receptors since it was found to bind estrogens. 

The fact that a novel ER according to the invention has been found is all the more surprising, since any suggestion 
towards the existence of additional estrogen receptors was absent in the scientific literature: neither the isolation of 
the classical ER nor the orphan receptors ERRa and ERRp suggested or hinted towards the presence of additional 

40 estrogen receptors such as the receptors according to the invention. The identification of additional ER's could be a 
major step forward for the existing clinical therapies, which are based on the existence of one ER and as such ascribe 
all estrogen mediated abnormalities and/or diseases to this one receptor. The receptors according to the invention will 
be useful in the development of hormone analogs that selectively activate either the classical ER or the novel estrogen 
receptor according to the invention. This should be considered as one of the major advantages of the present invention. 

45 Thus, in one aspect, the present invention provides for isolated cDNA encoding a novel steroid receptor In par- 

ticular, the present invention provides for isolated cDNA encoding a novel estrogen receptor. 

According to this aspect of the present invention, there is provided an isolated DNA encoding a steroid receptor 
protein having an N-terminal domain, a DNA-binding domain and a ligand-binding domain, wherein the amino acid 
sequence of said DNA-binding domain of said receptor protein exhibits at least 80% homology with the amino acid 

so sequence shown in SEQ ID NO:3, and the amino acid sequence of said ligand-binding domain of said receptor protein 
exhibits at least 70% homology with the amino acid sequence shown in SEQ ID NO:4. 

In particular, the isolated DNA encodes a steroid receptor protein having an N-terminal domain, a DNA-binding 
domain and a ligand-binding domain, wherein the amino acid sequence of said DNA-binding domain of said receptor 
protein exhibits at least 90%, preferably 95%, more preferably 98%, most preferably 100% homology with the amino 

55 acid sequence shown in SEQ ID NO:3. 

More particularly, the isolated DNA encodes a steroid receptor protein having an N-terminal domain, a DNA-binding 
domain and a ligand-binding domain , wherein the amino acid sequence of said ligand-binding domain of said receptor 
protein exhibits at least 75%, preferably 80%, more preferably 90%, most preferably 100% homology with the amino 
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acid sequence shown in SEQ ID NO:4. 

A preferred isolated DNA according to the invention encodes a steroid receptor protein having the amino acid 
sequence shown in SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:21 or SEQ ID NO:25. 

A more preferred isolated DNA according to the invention is an isolated DNA comprising a nucleotide sequence 
5 shown in SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:20 or SEQ ID NO:24. 

The DNA according to the invention may be obtained from cDNA. Alternatively, the coding sequence might be 
genomic DNA, or prepared using DNA synthesis techniques. 

The DNA according to the invention will be very useful for in vivo expression of the novel receptor proteins according 
to the invention in sufficient quantities and in substantially pure form. 
jo in another aspect of the invention, there is provided for a steroid receptor comprising the amino acid sequence 

encoded by the above described DNA molecules. 

The steroid receptor according to the invention has an N-terminal domain, a DNA-binding domain and a ligand- 
binding domain, wherein the amino acid sequence of said DNA-binding domain of said receptor exhibits at least 80% 
homology with the amino acid sequence shown in SEQ ID NO:3, and the amino acid sequence of said ligand-binding 
is domain of said receptor exhibits at least 70% homology with the amino acid sequence shown in SEQ ID NO:4. 

In particular, the steroid receptor according to the invention has an N-terminal domain, a DNA-binding domain and 
a ligand-binding domain, wherein the amino acid sequence of said DNA-binding domain of said receptor exhibits at 
least 90%, preferably 95%, more preferably 98%, most preferably 100% homology with the amino acid sequence 
shown in SEQ ID NO:3. 

20 More particular, the steroid receptor according to the invention has an N-terminal domain, a DNA-binding domain 

and a ligand-binding domain, wherein the amino acid sequence of said ligand-binding domain of said receptor exhibits 
at least 75%, prefearbly 80%, more preferably 90%, most preferably 100% homology with the amino acid sequence 
shown in SEQ ID NO:4. 

It will be clear for those skilled in the art that also steroid receptor proteins comprising combined DBD and LBD 
25 preferences and DNA encoding such receptors are subject of the invention. 

Preferably, the steroid receptor according to the invention comprises an amino acid sequence shown in SEQ ID 
NO:5, SEQ ID NO:6, SEQ ID NO:21 or SEQ ID NO:25. 

Also within the scope of the present invention are steroid receptor proteins which comprise variations in the amino 
acid sequence of the DBD and LBD without loosing their respective DNA-binding or ligand-binding activities. The 
30 variations that can occur in those amino acid sequences comprise deletions, substitutions, insertions, inversions or 
additions of (an) amino acid(s) in said sequence, said variations resulting in amino acid difference(s) in the overall 
sequence. It is well known in the art of proteins and peptides that these amino acid differences lead to amino acid 
sequences that are different from, but still homologous with the native amino acid sequence they have been derived 
from. 

35 Amino acid substitutions that are expected not to essentially alter biological and immunological activities, have 

been described in for example Dayhof, M.D., Atlas of protein sequence and structure, Nat. Biomed. Res. Found., 
Washington D.C. , 1 978, vol. 5, suppl. 3. Amino acid replacements between related amino acids or replacements which 
have occurred frequently in evolution are, inter alia Ser/Ala, Ser/Gly, Asp/Gly, Arg/Lys, Asp/Asn, lle/Val. Based on this 
information Lipman and Pearson developed a method for rapid and sensitive protein comparison (Science 227, 

40 1435-1441 , 1 985) and determining the functional similarity between homologous polypeptides. 

Variations in amino acid sequence of the DBD according to the invention resulting in an amino acid sequence that 
has at least 80% homology with the sequnece of SEQ ID NO:3 will lead to receptors still having sufficient DNA binding 
activity. Variations in amino acid sequence of the LBD according to the invention resulting in an amino acid sequence 
that has at least 70% homology with the sequnece of SEQ ID NO:4 will lead to receptors still having sufficient ligand 

45 binding activity. 

Homology as defined herein is expressed in percentages, determined via PCGENE. Homology is calculated as 
the percentage of identical residues in an alignment with the sequence according to the invention. Gaps are allowed 
to obtain maximum alignment. 

Comparing the amino acid sequences of the classical ER and the ER's according to the invention revealed a high 

50 degree of similarity within their respective DBD's. The conservation of the P-box (amino acids E-G-X-X-A) which is 
responsible for the actual interactions of the classical ER with the target DNA element (Zilliacus et al. t MoI.Endo. 9, 
389, 1995; Glass, End.Rev. 15, 391, 1994), is indicative for a recognition of estrogen responsive elements (ERE's) by 
the ER's according to the invention. The receptors according to the invention indeed showed ligand-dependent trans- 
activation on ERE-containing reporter constructs. Therefore, the classical ER and the novel ER's according to the 

55 invention may have overlapping target gene specificities. This could indicate that in tissues which co-express both 
respective ER's, these receptors compete for ERE's. The ER's according to the invention may regulate transcription 
of target genes differently from classical ER regulation or could simply block classical ER functioning by occupying 
estrogen responsive elements. Alternatively, transcription might be influenced by heterodimerization of the different 
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receptors. 

Thus, a preferred steroid receptor according to the invention comprises the amino acid sequence E-G-X-X-A within 
the P box of the DNA binding domain, wherein X stands for any amino acid. Also within the scope of the invention is 
isolated DNA encoding such a receptor. 
s Methods to prepare the receptors according to the invention are well known in the art (Sambrook et al., Molecular 

Cloning: a Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1 989). The most practical 
approach is to produce these receptors by expression of the DNA encoding the desired protein. 

A wide variety of host cell and cloning vehicle combinations may be usefully employed in cloning the nucleic acid 
sequence coding for the receptor of the invention. For example, useful cloning vehicles may include chromosomal, 
10 non -chromosomal and synthetic DNA sequences such as various known bacterial plasmids and wider host range 
plasmids and vectors derived from combinations of plasmids and phage or virus DNA. Useful hosts may include bac- 
terial hosts, yeasts and other fungi, plant or animal hosts, such as Chinese Hamster Ovary (CHO) cells or monkey 
cells and other hosts. 

Vehicles for use in expression of the ligand-binding domain of the present invention will further comprise control 

15 sequences operably linked to the nucleic acid sequence coding for the ligand-binding domain. Such control sequences 
generally comprise a promoter sequence and sequences which regulate and/or enhance expression levels. Further- 
more an origin of replication and/or a dominant selection marker are often present in such vehicles. Of course control 
and other sequences can vary depending on the host cell selected. 

Techniques for transforming or transfecting host cells are quite known in the art (see, for instance, Sambrook et 

20 al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 1989). 

Recombinant expression vectors comprising the DNA of the invention as well as cells transformed with said DNA 
or said expression vector also form part of the present invention. 

In a further aspect of the invention, there is provided for a chimeric receptor protein having an N-terminal domain, 
a DNA-binding domain, and a ligand-binding domain, characterized in that at least one of the domains originates from 

25 a receptor protein according to the invention, and at least one of the other domains of said chimeric protein originates 
from another receptor protein from the nuclear receptor superfamily, provided that the DNA-binding domain and the 
ligand-binding domain of said chimeric receptor protein originate from different proteins. 

In particular, the chimeric receptor according to the invention comprises the LBD according to the invention, said 
LBD having an amino acid sequence which exhibits at least 70% homology with the amino acid sequence shown in 

30 SEQ ID NO:4. In that case the N-terminal domain and DBD should be derived from another nuclear receptor, such as 
for example PR. In this way a chimeric receptor is constructed which is activated by a ligand of the ER according to 
the invention and which targets a gene under control of a progesterone responsive element The chimeric receptors 
having a LBD according to the invention are useful for the screening of compounds to identify novel ligands or hormone 
analogs which are able to activate an ER according to the invention. 

35 In addition, chimeric receptors comprising a DBD according to the invention, said DBD having an amino acid 

sequence exhibiting at least 80% homology with the amino acid sequence shown in SEQ ID NO:3, and a LBD and, 
optionally, an N-terminal domain derived from another nuclear receptor, can be succesf ully used to identify novel ligands 
or hormone analogs for said nuclear receptors. Such chimeric receptors are especially useful for the identification of 
the respective ligands of orphan receptors. 

40 Since steroid receptors have three domains with different functions, which are more or less independent, it is 

possible that all three functional domains have been derived from different members of the steroid receptor superfamily. 

Molecules which contain parts having a different origin are called chimeric. Such a chimeric receptor comprising 
the ligand-binding domain and/or the DNA-binding domain of the invention may be produced by chemical linkage, but 
most preferably the coupling is accomplished at the DNA level with standard molecular biological methods by fusing 

45 the nucleic acid sequences encoding the necessary steroid receptor domains. Hence, DNA encoding the chimeric 
receptor proteins according to the invention are also subject of the present invention. 

Such chimeric proteins can be prepared by transfecting DHA encoding these chimeric receptor proteins to suitable 
host cells and culturing these cells under suitable conditions. 

It is extremely practical if, next to the information for the expression of the steroid receptor, also the host cell is 

50 transformed or transfected with a vector which carries the information for a reporter molecule. Such a vector coding 
for a reporter molecule is characterized by having a promoter sequence containing one or more hormone responsive 
elements (HRE) functionally linked to an operative reporter gene. Such a HRE is the DNA target of the activated steroid 
receptor and, as a consequence, it enhances the transcription of the DNA, coding for the reporter molecule. In in vivo 
settings of steroid receptors the reporter molecule comprises the cellular response to the stimulation of the ligand. 

55 However, it is possible in vitro to combine the ligand-binding domain of a receptor to the DNA binding domain and 
transcription activating domain of other steroid receptors, thereby enabling the use of other HRE and reporter molecule 
systems. One such a system is established by a HRE presented in the MMTV-LTR (mouse mammary tumor virus long 
terminal repeat sequence in connection with a reporter molecule like the firefly lucif erase gene or the bacterial gene 



5 



1 



EP 0 798 378 A2 

for CAT (chloramphenicol transferase). Other HRE's which can be used are the rat oxytocin promotor, the retinoic acid 
responsive element, the thyroid hormone responsive element, the estrogen responsive element and also synthetic 
responsive elements have been described (for instance in Fuller, ibid, page 3096). As reporter molecules next to CAT 
and luciferase p-gatactosidase can be used. 

5 Steroid hormone receptors and chimeric receptors according to the present invention can be used for the in vitro 

identification of novel ligands or hormonal analogs. For this purpose binding studies can be performed with cells trans- 
formed with DNA according to the invention or an expression vector comprising DNA according to the invention, said 
cells expressing the steroid receptors or chimeric receptors according to the invention. 

The novel steroid hormone receptor and chimeric receptors according to the invention as well as the ligand-binding 

10 domain of the invention, can be used in an assay for the identification of functional ligands or hormone analogs for the 
nuclear receptors. 

Thus, the present invention provides for a method for identifying functional ligands for the steroid receptors and 
chimeric receptors according to the invention, said method comprising the steps of 

is a) introducing into a suitable host cell 1 ) DNA or an expression vector according to the invention, and 2) a suitable 

reporter gene functionally linked to an operative hormone response element, said HRE being able to be activated 
by the DNA-binding domain of the receptor protein encoded by said DNA; 

b) bringing the host cell from step a) into contact with potential ligands which will possibly bind to the ligand-binding 
domain of the receptor protein encoded by said DNA from step a); 
20 c) monitoring the expression of the receptor protein encoded by said reporter gene of step a). 

If expression of the reporter gene is induced with respect to basic expression (without ligand), the functional ligand 
can be considered as an agonist; if expression of the reporter gene remains unchanged or is reduced with respect to 
basic expression, the functional ligand can be a suitable (partial) antagonist. 

25 For performing such kind of investigations host cells which have been transformed or transfected with both a vector 

encoding a functional steroid receptor and a vector having the information for a hormone responsive element and a 
connected reporter molecule are cultured in a suitable medium. After addition of a suitable ligand, which will activate 
the receptor the production of the reporter molecule will be enhanced, which production simply can be determined by 
assays having a sensitivity for the reporter molecule. See for instance WO-A-88031 68. Assays with known steroid 

so receptors have been described (for instance S. Tsai et al. t Cell 57, 443, 1989; M. Meyer et al., Cell 57, 433, 1989). 

Legends to the figures 
Figure 1. 

35 

Northern analysis of the novel estrogen receptor (ERP). Two different multiple tissue Northern blots (Clontech) 
were hybridised with a specific probe for ERp (see examples). Indicated are the human tissues the RNA originated 
from and the position of the size markers in kilobases (kb). 

40 Figure 2. 

Histogram showing the 3- to 4-fold stimulatory effect of 1 7p-estradiol, estriol and estrone on the luciferase activity 
jnediated by Enp. An expression vector encoding ERp was transiently transfected into CHO cells together with a re- 
porter construct containing the rat oxytocin promoter in front of the firefly luciferase encoding sequence (see examples). 

45 

Figure 3. 

Effect of 17p-estradiol (E2) alone or in combination with the anti-estrogen ICI-164384 (ICI) on ERot and ERp. 
Expression constructs for ERct (the classical ER) and ERp were transiently transfected into CHO cells together with 
so the rat oxytocin promoter-lucif erase reporter construct described in the examples. Luciferase activities were determined 
in triplicate and normalised for transfection efficiency by measuring p-galactosidase in the same lysate. 

Figure 4. 

55 Expression of ERa and ERp in a number of cell lines determined by RT-PCR analysis (see examples). The cell 

lines used were derived from different tissues/cell types: endometrium (ECC1, Ishikawa, HEC-1A, RL95-2); osteosa- 
rcoma (SAOS-2, U2-OS, HOS, MG63); breast tumours (MCF-7, T47D), endothelium (HUV-EC-C, BAEC-1); smooth 
muscle (HISM, PAC-1, A7R5, A10, RASMC, CavaSMC); liver (HepG2); colon (CaCo2); and vagina (HS-760T, SW-954). 
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All cell lines were human except for PAC-1 , A7R5, A10 and RASMC which are of rat origin, BAEC-1 which is of 
bovine origin and CavaSMC which is of guinea pig origin. 

Figure 5. 

5 

Transactivation assay using stably transfected CHOcell lines expressing ERaor ERp together with the rat oxytocin- 
lucif erase estrogen-responsive reporter (see examples for details). Hormone-dependent transactivation curves were 
determined for 17p-estradiol and for Org4094. For the ER antagonist raloxifen, cells were treated with 2 x 1CT 10 mol/ 
L 17(i-estradiol together with increasing concentrations of raloxifen. Maximal values of the responses were arbitrarily 
10 set at 100%. 

Examples 

A Molecular cloning of the novel estrogen receptor. 

15 

Two degenerate oligonucleotides containing inosines (!) were based on conserved regions of the DNA-binding 
domains and the ligand-binding domains of the human steroid hormone receptors. 

Primer #1: 

20 

5' -GGIGA<C/T) GA (A/G) GC (A/T) TCIGGITG (C/T) CA (C/T) TA(C/T) GG-3' 
(SEQ ID NO: 7) . 

25 

Primer #2: 

5'-AAGCCTGG(C/G)A(C/T) IC(G/T) (C/T) TTIGCCCAI (C/T) TIAT-3' SEQ 
30 ID NO: 8) . 

As template, cDNAfrom human EBV-stimulated PBLs (peripheral blood leukocytes) was used. One microgram of 
total RNA was reverse transcribed in a 20 u.l reaction containing 50 mM KCI, 10 mM Tris-HCI pH 8.3, 4 mM MgCI2, 1 

35 mM dNTPs (Pharmacia), 100 pmol random hexanucleotides (Pharmacia), 30 Units RNAse inhibitor (Pharmacia) and 
200 Units Ivl-MLV Reverse transcriptase (Gibco BRL). Reaction mixtures were incubated at 37°C for 30 minutes and 
heat-inactivated at 100°C for 5 minutes. The cDNA obtained was used in a 100 uJ PCR reaction containing 10 mM 
Tris-HCI pH 8.3, 50 mM KCI, 1.5 mM MgCI2, 0.001% gelatin (w/v), 3% DMSO, 1 microgram of primer #1 and primer 
#2 and 2.5 Units of Amplitaq DNA polymerase (Perkin Elmer). PCR reactions were performed in the Perkin Elmer 9600 

40 thermal cycler The initial denaturation (4 minutes at 94°C) was followed by 35 cycles with the following conditions: 30 
sec. 94°C, 30 sec. 45°C, 1 minute 72°C and after 7 minutes at 72°C the reactions were stored at 4°C. Aliquots of these 
reactions were analysed on a 1 .5% agarose gel. Fragments of interest were cut out of the gel, reamplif ied using identical 
PCR-conditions and purified using Qiaex II (Qiagen). Fragments were cloned in the pCRII vector and transformed into 
bacteria using the TA-cloning kit (Invitrogen). Plasmid DNA was isolated for nucleotide sequence analysis using the 

45 Qiagen, plasmid midi protocol (Qiagen). Nucleotide sequence analysis was performed with the ALF automatic se- 
quencer (Pharmacia) using a T7 DNA sequencing kit (Pharmacia) with vector-specific or fragment-specific primers. 

One cloned fragment corresponded to a novel estrogen receptor (ER) which is closely related to the classical 
estrogen receptor. Part of the cloned novel estrogen receptor fragment (nucleotides 466 to 797 in SEQ ID 1) was 
amplified by PCR using oligonucleotide #3 TGTTACGAAGTGGGAATGGTGA (SEQ ID NO:9) and oligonucleotide #2 

so and used as a probe to screen a human testis cDN A library in ^gt1 1 (Clontech #HL1 01 0b). Recombinant phages were 
plated (using Y1 090 bacteria grown in LB medium supplemented with 0.2% maltose) at a density of 40.000 pfu (plaque- 
forming units) per 135 mm dish and replica filters (Hybond-N. Amersham) were made as described by the supplier. 
Filters were prehybridised in a solution containing 0.5 M phosphate buffer (pH 7.5) and 7% SDS at 65°C for at least 
30 minutes. DNA probes were purified with Qiaex II (Qiagen), ^P-labeled with a Decaprime kit (Ambion) and added 

55 to the prehybridisation solution. Filters were hybridised at 65°C overnight and then washed in 0.5 X SSC/0,1% SDS 
at 65°C. Two positive plaques were identified and could be shown to be identical. These clones were purified by 
rescreening one more time. A PCR reaction on the phage eluates with the Xgt11 -specific primers #4: 5'-TTGACAC- 
CAGACCAACTGGTAATG-3 1 (SEQ ID NO: 10) and #5: S'-GGTGGCGACGACTCCTGGAGCCCG-S' (SEQ ID NO:11) 
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yielded a fragment of 1700 basepairs on both clones. 

Subsequent PCR reactions using combinations of a gene-specific primer #6: 5'-GTACACTGATTTGTAGCTGGAC-3' 
(SEQ ID NO: 12) with the Xgtll primer #4 and gene-specific primer #7: 5'-CCATGATGATGTCCCTGACC-3' (SEQ ID 
NO: 13) with Xgt11 primer primer #5 yielded fragments of approximately 450 bp and 1000 bp, respectively, which were 

s cloned in the pCRII vector and used for nucleotide sequence analysis. The conditions for these PCR reactions were 
as described above except for the primer concentrations (200 ng of each primer) and the annealing temperature (60°C). 
Since in the cDNA clone the homology with the ER is lost abruptly at a site which corresponds to the exon 7/exon 8 
boundary in the ER (between nucleotides 1247 and 1248 in SEQ ID NO:1), it was suggested that this sequence cor- 
responds to intron 7 of the novel ER gene. For verification of the nucleotide sequences of this cDNA clone, a 1200 bp 

10 fragment was generated on the cDNA clone with >.gt11 primer #4 with a gene-specific primer #8 corresponding to the 
3' end of exon 7: 5'-TCGCATGCCTGACGTGGGAC-3 , (SEQ ID NO: 14) using the proofreading Pfu polymerase (Strat- 
agene). This fragment was also cloned in the pCRII vector and completely sequenced and was shown to be identical 
to the sequences obtained earlier. 

To obtain nucleotide sequences of the novel ER downstream of exon 7, a degenerate oligonucleotide based on 

is the AF-2 region of the classical ER (#9: 5 , -GGC(C/G)TCCAGCATCTCCAG(C/G)A(A/G)GAG-3 , ; SEQ ID NO:15) was 
used together with the gene-specific oligonucleotide #10: 5'-GGAAGCTGGCTCACTTGCTG-3' (SEQ ID NO:16) using 
testis cDNA as template (Marathon ready testis cDNA, Clontech Cat #741 4-1 ). A specific 220 bp fragment correspond- 
ing to nucleotides 1112 to 1332 in SEQ ID No. 1 was cloned and sequenced. Nucleotides 11 12 to 1247 were identical 
to the corresponding sequence of the cDNA clone. The sequence downstream thereof is highly homologous with the 

20 corresponding region in the classical ER. In order to obtain sequences of the novel ER downstream of the AF-2 region, 
RACE (rapid amplification of cDNA ends) PCR reactions were performed using the Marathon-ready testis cDNA (Clon- 
tech) as template. The initial PCR was performed using oligonucleotide #11 : 5'-TCTTGTTCTGGACAGGG ATG-3' (SEQ 
ID NO: 17) in combination with the AP1 primer provided in the kit. A nested PCR was performed on an aliquot of this 
reaction using oligonucleotide #10 (SEQ ID NO: 16) in combination with the oligo dT primer provided in the kit. Subse- 
ts quently, an aliquot of this reaction was used in a nested PCR using oligonucleotide #12: 5'-GCATGGAACATCTGCT- 
CAAC-3' (SEQ ID NO: 18) in combination with the oligo dT primer. Nucleotide sequence analysis of a specific fragment 
that was obtained (corresponding to nucleotides 1256 to 1431 in SEQ ID NO 1) revealed a sequence encoding the 
carboxyterminus of the novel ER ligand-binding domain, including an F-domain and a translational stop codon and 
part of the 3' untranslated sequence which is not included in SEQ ID NO:1. The deduced amino acid sequence is 

30 shown in SEQ ID NO:5. 

In order to investigate the possibility that the novel estrogen receptor had additional, upstream translation-initiation 
codons, RACE-PCR experiments were performed using Marathon-ready testis cDNA (Clontech Cat. # 7414-1). First 
a PCR was performed using oligonucleotide SEQ ID NO:12 (antisense corresponding to nucleotides 416-395 in SEQ 
ID NO:1) and AP-1 (provided in the kit). A nested PCR was then performed using oligonucleotide having SEQ ID NO: 

35 27 (antisense corresponding to nucleotides 254-231 in SEQ ID NO:1 ) with AP-2 (provided in the kit). From the smear 
that was obtained, the region corresponding to fragments larger than 300 basepairs was cut out, purified using the 
Genecleanll kit (Bio101) and cloned using the TA-cloning kit (Clontech). Colonies were screened by PCR using gene- 
specific primers: SEQ ID NO:22 and SEQ ID NO:28. The clone containing the largest insert was sequenced. The 
nucleotide sequence corresponds to nucleotides 1 to 490 in SEQ ID NO:24. It is clear from this sequence that the first 

40 in-frame upstream translation initiation codon is present at position 77-79 in SEQ ID NO:24. Upstream of this transla- 
tional startcodon an in-frame stop-codon is present (11-13 in SEQ ID NO:24). Consequently, the reading frame of the 
novel estrogen receptor is 530 amino acids (shown in SEQ ID NO:25) and has a calculated molecular mass of 59.234 
J<D. 

To confirm the nucleotide sequences obtained by 5' RACE, human genomic clones were obtained and analysed. 

45 A human genomic library in >£MBL3 (Clontech HL1067J) was screened with a probe corresponding to nucleotides 1 
to 41 6 in SEQ I D NO: 1 . A strongly hybridizing clone was plaque-purified and DN A was isolated using standard protocols 
(Sambrook et al, 1 989). The DNA was digested with several restriction enzymes, electrophoresed on agarose gel and 
blotted onto Nylon filters. Hybridisation of the blot with a probe corresponding to the above-mentioned RACE fragment 
(nucleotides 1-490 in SEQ ID NO:24) revealed a hybridizing Sau3A fragment of approximately 800 basepairs. This 

50 fragment was cloned into the BamH1 site of pGEM3Z and sequenced. The nucleotide sequence contained one base 
difference which is probably a PCR-induced point mutation in the RACE fragment. Nucleotide 172 was a G residue in 
the 5'RACE fragment, but an A residue in several independent genomic subclones. 

B. Identification of two splice variants of the novel estrogen receptor. 

55 

Rescreening of the testis cDNA library with a probe corresponding to nucleotides 918 to 1246 in SEQ ID No. 1 
yielded two hybridizing clones, the 3' end of which were amplified by PCR (gene-specific primer #10: 5'-GGAAGCT- 
GGCTCACTTGCTG-3' (SEQ ID NO:16) together with primer #4, SEQ ID NO:10), cloned and sequenced. One clone 
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was shown to contain an alternative exon 8 (exon 8B) of the novel ER. In SEQ ID No. 2 the protein encoding part and 
the stopcodon of this splice variant are presented. As a consequence of the introduction of this exon through an alter- 
native splicing reaction, the reading frame encoding the novel ER is immediately terminated, thereby creating a trun- 
cation of the carboxyterminus of the novel ER (SEQ ID NO:6). 

Screening of a human thymus cDNA library (Clontech HL1074a) with the probe corresponding to nucleotides 918 
to 1246 in SEQ ID No. 1, revealed another splice variant. The 3' end of one hybridizing clone was amplified using 
primer #10 (SEQ ID NO: 16) with the Xgt10-specific primer #1 3 5'-AGCAAGTTCAGCCTGTTAAGT-3' (SEQ ID NO: 19), 
cloned and sequenced. The obtained nucleotide sequence upstream of the exon 7/exon 8 boundary was identical to 
the clones identified earlier. However, an alternative exon 8 (exon 8C) was present at the 3' end encoding two C- 
terminal amino acids followed by a stop-codon. The nucleotide sequence of the protein-encoding part of this splice 
variant is shown in SEQ ID NO;20, the corresponding protein sequence is SEQ ID NO:21. 

These two variants of the novel estrogen receptor do not contain the AF-2 region and therefore probably lack the 
ability to modulate transcription of target genes in a ligand-dependent fashion. However, the variants potentially could 
interfere with the functioning of the wild-type classical ER and/or the wild-type novel ER, either by heterodimerization 
or by occupying estrogen response elements or by interactions with other transcription factors. A mutant of the classical 
ER (ER1-530) has been described which closely resembles the two variants of the novel estrogen receptor described 
above. ER1-530 has been shown to behave as a dominant-negative receptor i.e. it can modulate the intracellular 
activity of the wild type ER (Ince et al, J. Biol. Chem. 268, 14026-14032. 1993). 

C. Northern blot analysis. 

Human multiple tissue Northern blots (MTN-blots) were purchased from Clontech and prehybridized for at least 1 
hour at 65°C in 0.5 M phosphate buffer pH 7.5 with 7% SDS. The DNA fragment that was used as a probe (corre- 
sponding to nucleotides 466 to 797 in SEQ ID No. 1) was 32 P-labeled using a labelling kit (Ambion), denatured by 
boiling and added to the prehybridisation solution. Washing conditions were: 3X SSC at room temperature, followed 
by 3 X SSC at 65°C, and finally 1 X SSC at 65°C. The filters were than exposed to X-ray films for one week. Two 
transcripts of approximately 8 kb and 10 kb were detected in thymus, spleen, ovary and testis. In addition, a 1.3 kb 
transcript was detected in testis. 

D. RT-PCR analysis of expression of ERa and ER$ in cell lines. 

RN A was isolated from a number of human and animal cell lines using RNAzol B (Cinna/Biotecx). cDNA was made 
using 2.5 microgram of total RNA using the Superscript II kit (BRL) following the manufacturers instructions. A portion 
of the cDNA was used for specific PCR amplifications of fragments corresponding either to mRNA encoding the ER 
or to the novel estrogen receptor. (It should be emphasized that the primers used are based on human and rat se- 
quences, whereas some of the cell lines were not rat or human, see legend of Figure 4). Primers used were for ERa: 
sense 5'-GATGGGCTTACTGACCAACC-3' and antisense S'-AGATGCTCCATGCCTTTG-S' generating a 548 base pair 
fragment corresponding to part of the LBD. For ERp: sense 5'- TTCACCGAGGCCTCCATGATG-3' and antisense 5'- 
CAGATGTTCCATGCCCTTGTT-3' generating a 565 base pair fragment corresponding to part of the LBD. The PCR 
samples were analysed on agarose which were blotted onto Nylon membranes. These blots were hybridised with 32 P- 
labeled PCR fragments generated with the above-mentioned primers on ERa and ERp plasmid DNA using standard 
experimental procedures (Sam brook et a/, 1 989). 

£. Ligand-dependent transcription activation by the novel estrogen receptor protein. 

Cell culture 

Chinese Hamster Ovary (CHO K1 ) cells were obtained from ATCC (CCL61 ) and maintained at 37°C in a humidified 
atmosphere (5% C0 2 ) as a monolayer culture in fenolred-free M505 medium. The latter medium consists of a mixture 
(1:1) of Dulbecco's Modified Eagle's Medium (DMEM, Gibco 074-200) and Nutrient Medium F12 (Ham's F12, Gibco 
074-1700) supplemented with 2.5 mg/ml sodium carbonate (Baker), 55 |ig/ml sodium pyruvate (Fluka), 2.3 u.g/ml p- 
mercaptoethanoi (Baker), 1 .2 u.g/ml ethanolamine (Baker), 360 ^ig/ml L-glutamine (Merck), 0.45 u,g/ml sodium selenite 
(Fluka), 62.5 ng/ml pen icillin (Mycopharm), 62.5 \ig/m\ streptomycin (Serva), and 5% charcoal-treated bovine calf serum 
(Hyclone). 

Recombinant vectors 

The ERp-encoding sequence as presented in SEQ ID No. 1 was amplified by PCR using oligonucleotides 5'- 
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CTTGGATCCATAGCCCTGCTGTGATGAATTACAG-3' (SEQ ID NO:22 underlined is the translation initiation codon) 
in combination with 5'-GATGGATCCTCACCTCAGGGCCAGGCGTCACTG-3' (SEQ ID NO:23) (underlined is the trans- 
lation stopcodon, antisense). The resulting BamH1 fragment (approximately 1450 base pairs) were then cloned in the 
mammalian cell expression vector pNGV1 (Genbank accession No. X99274). 

s An expression construct encoding the ERp reading frame as presented in SEQ ID NO:24 was made by replacing 

aBamH1-Msc1 fragment (nucleotides 1-81 in SEQ ID No. 1) by a BamH1-Msc1 fragment corresponding to nucleotides 
77-316 in SEQ ID No. 24. The latter fragment was made by PCR with SEQ ID NO:26 in combination with SEQ ID NO: 
28 using the above mentioned 5' RACE fragment. 

The reporter vector was based on the rat oxytocin gene regulatory region (position -363/+16 as a Hindlll/ Mbol 

10 fragment; RJvell, and D.Richter, Proc. Natl. Acad. Sci. USA 81^ 2006-201 0, 1984) linked to the firefly luciferase encoding 
sequence; the regulatory region of the oxytocin gene was shown to possess functional estrogen hormone response 
elements in vitro for both the rat (R.Adan etal, Biochem.Biophys.Res.Comm. V75, 117-122, 1991) and the human (S. 
Richard, and KZingg, J.Biol.Chem. 265, 6098-6103, 1990). 

15 Transient transfection 

1 x 1 0 5 CHO cells were seeded in 6-wells Nunclon tissue culture plates and DN A was introduced by use of lipofectin 
(Gibco BRL). Hereto, the DNA (1 u.g of both receptor and reporter vector in 250 ul Optimem, Gibco BRL) was mixed 
with an equal volume of lipofectin reagent (7 uX in 250 uJ_ Optimem, Gibco) and allowed to stand at room temperature 

20 for 15 min. After washing the cells twice with serum-free medium (M505) new medium (500 jiL Optimem, Gibco) was 
added to the celts followed by the dropwise addition of the DNA-lipofectin mixture. After incubation for a 5 hour period 
at 37°C cells were washed twice with fenolred-free M505 + 5% charcoal -treated bovine calf serum and incubated 
overnight at 37°C. After 24 hours hormones were added to the medium (1 0" 7 mol/L). Cell extracts were made 48 hours 
posttransfection by the addition of 200 u,L lysisbuffer (0.1 M phosphate buffer pH7.8, 0.2% Triton X-100). After incu- 

25 bation for 5 min at 37°C the cell suspension was centrifuged (Eppendorf centrifuge, 5 min) and 20 ul sample was 
added to 50 uJ_ luciferase assay reagent (Promega). Light emission was measured in a luminometer (Berthold Biolumat) 
for 10 sec at 562 nm. 

Stable transfection of the novel estrogen receptor. 

30 

The expression plasmid encoding full-length ERp1-530 (see above) was stably transfected in CHO K1 cells as 
previously described (Theunissen et at, J. Biol. Chem. 268, 9035-9040, 1993). Single cell clones that were obtained 
this way were screened by transient transfection of the reporter plasmid (rat oxytocin-lucif erase) as described above. 
Selected clones were used for a second stable transfection of the rat oxytocin -luciferase reporter plasmid together 

35 with the plasmid pDR2A which contains a hygromycine resitance gene for selection. Single cell clones obtained were 
tested for a response to 17p-estradiol. Subsequently, a selected single cell clone was used for transactivation studies. 
Briefly, cells were seeded in 96-wells at (1.6X10 4 cells per well). After 24 hours different concentrations of hormone 
were diluted in medium and added to the wells. For antagonistic experiments, 2x10" 10 M. 17p-estradiol was added to 
each well and different concentrations of antagonists were added. Cells were washed once with PBS after a 24 hour 

40 incubation and then lysed by the addition of 40 microliter lysis buffer (see above). Luciferase reagent was added (50 
microliter) to each well and light emission was measured using the Topcount (Packard). 

Results. 
* 

45 A comparison of the two expression constructs (SEQ ID NO:1 and SEQ ID NO:24) in transient transfections in 

CHO cells showed identical transactivation in response to a number of agonists and antagonists. CHO cells transiently 
transfected with ERp expression vector and a reporter plasmid showed a 3 to 4 fold increase in luciferase activity in 
response to 17p-estradiol as compared to untreated cells (see Figure 2). A similar transactivation was obtained upon 
treatment with estriol and estrone. The results indicate not only that the novel ER (ERp) can bind estrogen hormones 

so but also that the ligand-activated receptor can bind to the estrogen-response elements (EREs) within the rat oxytocin 
promoter and activate transcription of the luciferase reporter gene. Figure 3 shows that in an independent similar 
experiment 10" 9 mol/L 17p-estradiol gave an 1 8-fold stimulation with ERaand a 7-fold stimulation with ERp. In addition, 
the antiestrogen 101-1 64384 was shown to be an antagonist for both ERcc and ERp when activated with 17p-estradiol, 
whereas the antagonist alone had no effect. In this experiment 0.25 u,g p-galactosidase vector was co-transfected in 

ss order to normalize for differences in transfection efficiency. 

Transactivation studies performed on stably transfected ERaand ERP cell lines gave similar absolute luciferase 
values. The curves for 1 7p-estradiol are very simitar and show that half-maximal transactivation is reached with lower 
concentrations of hormone on ERct as compared to ERp (Figure 5). For Org4094 this is also the case however, the 
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effect observed is much more pronounced. The curves for raloxifen show that the potency of this antagonist to block 
transactivation on ERa is greater compared to its potency to block ERp transactivation. 
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SEQUENCE LISTING 

5 

(1) GENERAL INFORMATION: 
(i) APPLICANT; 

10 

(A) NAME: Akzo nobel n.v. 

(B) STREET: Velperweg 76 

(C) CITY: Arnhem 

is (E) COUNTRY: The Netherlands 

<F) POSTAL CODE (ZIP) : 6824 BM 

(G) TELEPHONE: 0412-666379 

(H) TELEFAX: 04X2-650592 
20 (I) TELEX: 37503 akpha nl 

<ii) TITLE OF INVENTION: Novel estrogen receptor 

25 

(ill) NUMBER OF SEQUENCES: 28 

(lv) COMPUTER READABLE FORM: 
30 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

35 

(2) INFORMATION FOR SEQ ID NO: 1: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1434 base pairs 
4 (B) TYPE: nucleic acid 

45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

Ui> MOLECULE TYPE: cDNA 

so 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

55 
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ATGAATTACA GCATTCCCAG CAATGTCACT AACTTGGAAG GTGGGCCTGG TCGGCAGACC 60 

5 ACAAGCCCAA ATGTGTTGTG GCCAACACCT GGGCACCTTT CTCCTTTAGT GGTCCATCGC 120 

CAGTTATCAC ATCTGT AT GC GGAACCTCAA AAGAGTCCCT GGTGTGAAGC AAGATCGCTA 180 

10 

GAACACACCT TACCTGTAAA CAGAGAGACA CTGAAAAGGA AGGTTAGTGG GAACCGTTGC 240 

GCCAGCCCTG TTACTGGTCC AGGTTCAAAG AGGGATGCTC ACTTCTGCGC TGTCTGCAGC 300 

15 

GATTACGCAT CGGGATATCA CTATGGAGTC TGGTCGTGTG AAGGATGTAA GGCCTTTTTT 360 

2Q AAAA6AAGCA TTCAAGGACA TAATGATTAT ATTTGTCCAG CTACAAATCA GTGTACAATC 420 

GATAAAAACC GGCGCAAGAG CTGCCAGGCC TGCCGACTTC GGAAGTGTTA CGAAGTGGGA 480 

25 ATGGTGAAGT GTGGCTCCCG GAGAGAGAGA TGTGGGTACC GCCTTGTGCG GAGACAGAGA 540 

AGTGCCGACG AGCAGCTGCA CTGTGCCGGC AAGGCCAAGA GAAGTGGCGG CCACGCGCCC 600 

30 

CGAGTGCGGG AGCTGCTGCT GGACGCCCTG AGCCCCGAGC AGCTAGTGCT CACCCTCCTG 660 

GAGGCTGAGC CGCCCCATGT GCTGATCAGC CGCCCCAGTG CGCCCTTCAC CGAGGCCTCC 720 

35 

ATGATGATGT CCCTGACCAA GTTGGCCGAC AAGGAGTTGG TACACATGAT CAGCTGGGCC 780 

AAGAAGATTC CCGGCTTTGT GGAGCTCAGC CTGTTCGACC AAGTGCGGCT CTTGGAGAGC 840 

40 

TGTTGGATGG AGGTGTTAAT GATGGGGCTG ATGTGGCGCT CAATTGACCA CCCCGGCAAG 900 

45 CTCATCTTTG CTCCAGATCT TGTTCTGGAC AGGGATGAGG GGAAATGCGT AGAAGGAATT 960 

CTGGAAATCT TTGACATGCT CCTGGCAACT ACTTCAAGGT TTCGAGAGTT AAAACTCCAA 1020 

60 CACAAAGAAT ATCTCTGTGT CAAGGCCATG ATCCTGCTCA ATTCCAGTAT GTACCCTCTG 1080 

GTCACAGCGA CCCAGGATGC TGACAGCAGC CGGAAGCTGG CTCACTTGCT GAACGCCGTG 1140 

55 
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ACCGATGCTT TGGTTTGGGT GATTGCCAAG AGCGGCATCT CCTCCCAGCA GCAATCCATG 1200 
5 CGCCTGGCTA ACCTCCTGAT GCTCCTGTCC CACGTCAGGC ATGCGAGTAA CAAGGGGATG 1260 

GAACATCTGC TCAACATGAA GTGCAAAAAT GTGGTCCCAG TGTATGACCT GCTGCTGGAG 1320 
10 ATGCTGAATG CCCACGTGCT TCGCGGGTGC AAGTCCTCCA TCACGGGGTC CGAGTGCAGC 1380 

CCGGCAGAGG ACAGTAAAAG CAAAGAGGGC TCCCAGAACC CACAGTCTCA GTGA 1434 

75 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: cDNA 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
35 ATGAATTACA GCATTCCCAG CAATGTCACT AACTTGGAAG GTGGGCCTGG TCGGCAGACC 60 

ACAAGCCCAA ATGTGTTGTG GCCAACACCT GGGCACCTTT CTCCTTTAGT GGTCCATCGC 120 

40 

CAGTTATCAC ATCTGTATGC GGAACCTCAA AAGAGTCCCT GGTGTGAAGC AAGATCGCTA 180 

4 GAACACACCT TACCTGTAAA CAGAGAGACA CTGAAAAGGA AGGTTAGTGG GAACCGTTGC 240 

45 

GCCAGCCCTG TTACTGGTCC AGGTTCAAAG AGGGATGCTC ACTTCTGCGC TGTCTGCAGC 300 

GATTACGCAT CGGGATATCA CTATGGAGTC TGGTCGTGTG AAGGATGTAA GGC C TTT TTT **360 

so 

AAAAGAAGCA TTCAAGGACA TAATGATTAT ATTTGTCCAG CTACAAATCA GTGTACAATC 420 

GATAAAAACC GGCGCAAGAG CTGCCAGGCC TGCCGACTTC GGAAGTGTTA CGAAGTGGGA 480 
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ATGGTGAAGT GTGGCTCCCG GAGAGAGAGA TGTGGGTACC GCCTTGTGCG GAGAGAGAGA 540 

AGTGCCGACG AGCAGCTGCA CTGTGCCGGC AAGGCCAAGA GAAGTGGCGG CCACGCGCCC 600 

CGAGTGCGGG AGCTGCTGCT GGACGCCCTG AGCCCCGAGC AGCTAGTGCT CACCCTCCTG 660 

GAGGCTGAGC CGCCCCATGT GCTGATCAGC CGCCCCAGTG CGCCCTTCAC CGAGGCCTCC 720 

15 ATGATGATGT CCCTGACCAA GTTGGCCGAC AAGGAGTTGG TACACATGAT CAGCTGGGCC 760 

AAGAAGATTC CCGGCTTTGT GGAGCTCAGC CTGTTCGACC AAGTGCGGCT CTTGGAGAGC 840 

20 TGTTGGATGG AGGTGTTAAT GATGGGGCTG ATGTGGCGCT CAATTGACCA CCCCGGCAAG 900 

CTCATCTTTG CTCCAGATCT TGTTCTGGAC AGGGATGAGG GGAAATGCGT AGAAGGAATT 960 

CTGGAAAXCT TTGACATGCT CCTGGCAACT ACTTCAAGCT TTCGAGAGTT AAAACTCCAA 1020 

CACAAAGAAT ATCTCTGTGT GAAGGCCATG ATCCTGCTCA ATTCCAGTAT GTACCCTCTG 1080 

GTCACAGCGA CCCAGGATGC TGACAGCAGC CGGAAGCTGG CTCACTTGCT GAACGCCGTG 1140 

ACCGATGCTT TGGTTTGGGT GATTGCCAAG AGCGGCATCT CCTCCCAGCA GCAATCCATG 1200 
CGCCTGGCTA ACCTCCTGAT GCTCCTGTCC CACGTCAGGC ATGCGAGGTG A 
40 (2) INFORKATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



30 



35 



45 



50 



55 



(ii) MOLECULE TYPE: peptide 



1251 



♦ 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Cys Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp 
15 10 15 

Ser Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His 
20 25 30 

Asn Asp Tyr He Cys Pro Ala Thr Asn Gin Cys Thr He Asp Lys Asn 
35 40 45 

Arg Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val 
50 55 €0 

Gly Met 
65 

INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Leu Val Leu Thr Leu Leu Glu Ala Glu Pro Pro His Val Leu He Ser 
15 10 15 

Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser Met Met Met Ser Leu Thr 
20 25 30 

Lys Leu Ala Asp Lys Glu Leu Val His Met He Ser Trp Ala Lys Lys 
35 40 45 
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10 



15 



20 



SO 



55 



He Pro Gly Phe Val Glu Leu Ser Leu Phe Asp Gin Val Arg Leu Leu 
50 55 60 

Glu Ser Cys Trp Met Glu Val Leu Met Met Gly Leu Met Trp Arg Ser 
65 70 75 80 

He Asp His Pro Gly Lys Leu He Phe Ala Pro Asp Leu Val Leu Asp 
85 90 95 

Arg Asp Glu Gly Lys Cys Val Glu Gly lie Leu Glu He Phe Asp Met 
100 105 110 

Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu Leu Lys Leu Gin His Lys 
115 120 125 



Glu Tyr Leu Cys Val Lys Ala Met He Leu Leu Asa Ser Ser Met Tyr 
25 130 135 140 

Pro Leu Val Thr Ala Thr Gin Asp Ala Asp Ser Ser Arg Lys Leu Ala 
145 150 155 160 

30 

His Leu Leu Aan Ala Val Thr Asp Ala Leu Val Trp Val He Ala Lys 

165 170 175 

35 

Ser Gly He Ser Ser Gin Gin Gin Ser Met Arg Leu Ala Asn Leu Leu 
160 185 190 

40 

Met Leu Leu Ser His Val Arg His Ala Ser Asn Lys Gly Met Glu His 
195 200 205 

45 Leu Leu Asn Met Lys Cys Lys Asn Val Val Pro Val Tyr Asp Leu Leu 

210 215 220 



Leu Glu Met Leu Asn Ala His Val Leu 
225 230 

(2) INFORMATION FOR SEQ ID NO: 5; 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDKESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Asn Tyr Ser lie Pro Ser Asn Val Thr Asn Leu Glu Gly Gly Pro 
15 10 15 

Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp Pro Thr Pro Gly His 
20 25 30 

Leu Ser Pro Leu Val Val His Arg Gin Leu Ser His Leu Tyr Ala Glu 
35 40 45 

Pro Gin Lys Ser Pro Trp Cys Glu Ala Arg Ser Leu Glu His Thr Leu 

50 55 60 

Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val Ser Gly Asn Arg Cys 

65 70 75 80 

Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg Asp Ala His Phe Cys 
85 90 95 

Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp Ser 
100 105 110 

Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His Asn 
115 120 125 

Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr He Asp Lys Asn Arg 
130 135 140 
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Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lya Cys Tyr Glu Val Gly 
145 150 155 160 

Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys Gly Tyr Arg Leu Val 
165 170 175 

Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His Cys Ala Gly Lys Ala 
180 165 190 

Lys Arg Ser Gly Gly His Ala Pro Arg Val Arg Glu Leu Leu Leu Asp 
195 200 205 

Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu Leu Glu Ala Glu Pro 
210 215 220 

Pro His Val Leu lie Ser Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser 
225 230 235 240 

Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys Glu Leu Val His Met 
245 250 255 

He Ser Trp Ala Lys Lys lie Pro Gly Phe Val Glu Leu Ser Leu Phe 
260 265 270 

Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met Glu Val Leu Met Met 
275 280 285 

Gly Leu Met Trp Arg Ser He Asp His Pro Gly Lys Leu He Phe Ala 
290 295 300 

Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lys Cys Val Glu Gly He 
305 310 315 320 

Leu Glu He Phe Asp Met Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu 
325 330 335 

Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val Lys Ala Met He Leu 
340 345 350 
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Leu Asn Ser Ser Met Tyr Pro Leu 
355 360 

Ser Ser Arg Lys Leu Ala His Leu 
370 375 

Val Trp Val He Ala Lys Ser Gly 
385 390 

Arg Leu Ala Asn Leu Leu Net Leu 
405 

Asn Lys Gly Met Glu His Leu Leu 
420 

Pro Val Tyr Asp Leu Leu Leu Glu 
435 440 

Gly Cys Lys Ser Ser He Thr Gly 
450 455 



Ser Lys Ser Lys Glu Gly Ser Gin Asn Pro Gin Ser Gin 
465 470 475 



(2) INFORMATION FOR SfiQ ID NO: 6: 



Val Thr Ala Thr Gin Asp Ala Asp 
365 

Leu Asn Ala Val Thr Asp Ala Leu 
380 

He Ser Ser Gin Gin Gin Ser Met 
395 400 

Leu Ser His Val Arg His Ala Ser 
410 415 

Asn Met Lys Cys Lys Asn Val Val 
425 430 

Met Leu Asn Ala His Val Leu Arg 
445 

Ser Glu Cys Ser Pro Ala Glu Asp 
460 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asn Tyr Ser He Pro Ser Asn Val Thr Asn Leu Glu Gly Gly Pro 

20 
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10 



15 



Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp Pro Thr Pro Gly His 
20 25 30 



10 



Leu Ser Pro Leu Val Val His Arg Gin Leu Ser His Leu Tyr Ala Glu 
35 40 45 



15 



Pro Gin Lya Ser Pro Trp Cya Glu Ala Arg Ser Leu Glu Hia Thr Leu 
50 55 60 



20 



25 



Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val Ser Gly Asn Arg Cya 
65 70 75 80 

Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg Asp Ala His Phe Cys 
85 90 95 

Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp Ser 
100 105 no 



30 



Cya Glu Gly Cys Lys Ala Phe Phe Lya Arg Ser lie Gin Gly His Asn 
115 120 125 



35 



Asp Tyr lie Cya Pro Ala Thr Asn Gin Cys Thr He Asp Lys Asn Arg 

130 135 140 



40 



Arg Lys Ser Cys Gin Ala Cya Arg Leu Arg Lys Cys Tyr Glu Val Gly 
145 150 155 160 



45 



Het Val Lys Cys Gly Ser Arg Arg Glu Arg Cys Gly Tyr Arg Leu Val 
165 170 175 

Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His Cys Ala Gly Lys Ala 
180 185 190 



50 



Lya Arg Ser Gly Gly His Ala Pro Arg Val Arg Glu Leu Leu Leu Asp 
195 200 205 



55 



Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu Leu Glu Ala Glu Pro 
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210 



215 



220 



Pro His Val Leu lie Ser Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser 
225 230 235 240 

Met Met Met Ser Leu Thr Lys Leu Ala Aap Lya Glu Leu Val Hia Met 
245 250 255 

lie Ser Trp Ala Lya Lya lie Pro Gly Phe Val Glu Leu Ser Leu Phe 
260 265 270 

Aap Gin Val Arg Leu Leu Glu Ser Cys Trp Met Glu Val Leu Met Met 
275 280 285 

Gly Leu Met Trp Arg Ser lie Asp His Pro Gly Lys Leu He Phe Ala 
290 295 300 

Pro Aap Leu Val Leu Asp Arg Asp Glu Gly Lys Cys Val Glu Gly He 

305 310 315 320 

Leu Glu He Phe Asp Met Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu 
325 330 335 

Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val Lys Ala Met He Leu 
340 345 350 

Leu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala Thr Gin Asp Ala Asp 

355 360 365 

Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala Val Thr Aap Ala Leu 

370 375 380 

Val Trp Val He Ala Lys Ser Gly He Ser Ser Gin Gin Gin Ser Met 
385 390 395 400 



Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His Val Arg His Ala Arg 
405 410 415 
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(2) INFORMATION FOR SBQ ID NO: 7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS t both 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGIGAYGARG CWTCIGGITG YCAYTAYGG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NOt 6: 
AAGCCTGGSA YICKYTTIGC CCAIYTIAT 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TGTTACGAAG TGGGAATGGT GA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTGACACCAG ACCAACTGGT AATG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
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GGTGGCGACG ACTCCTGGAG CCCG 
(2) INFORMATION FOR SBQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GTACACTGAT TTGTAGCTGG AC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCATGATGAT GTCCCTGACC 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
TCGCATGCCT GACGTGGGAC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
GGCSTCCAGC ATCTCCAGSA RCAG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 16: 
GGAAGCTGGC TCACTTGCTG 
(2) INFORMATION FOR SEQ ID NO: 17: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TCTTGTTCTG GACAGGGATG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCATGGAACA TCTGCTCAAC 
(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base paxes 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDHA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AGCAAGTTCA GCCTGTTAAG T 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS t double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
ATGAATTACA GCATTCCCAG CAATGTCACT AACTTGGAAG GTGGGCCTGG TCGGCAGACC 
ACAAGCCCAA ATGTGTTGTG GCCAACACCT GGGCACCTTT CTCCTTTAGT GGTCCATCGC 
CAGTTATCAC ATCTGTATGC GGAACCTCAA AAGAGTCCCT GGTGTGAAGC AAGATCGCTA 
GAACACACCT TACCTGTAAA CAGAGAGACA CTGAAAAGGA AGGTTAGTGG GAACCGTTGC 
GCCAGCCCTG TTACTGGTCC AGGTTCAAAG AGGGATGCTC ACTTCTGCGC TGTCTGCAGC 
GATTACGCAT CGGGATATCA CTATGGAGTC TGGTCGTGTG AAGGATGTAA GGCCTTTTTT 
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AAAAGAAGCA TTCAAGGACA TAATGATTAT ATTTGTCCAG CTACAAATCA GTGTACAATC 420 

GATAAAAACC GGCGCAAGAG CTGCCAGGCC TGCCGACTTC GGAAGTGTTA CGAAGTGGGA 480 

ATGGTGAAGT GTGGCTCCCG GAGAGAGAGA TGTGGGTACC GCCTTGTGCG GAGACAGAGA 540 

AGTGCCGACG AGCAGCTGCA CTGTGCCGGC AAGGCCAAGA GAAGTGGCGG CCACGCGCCC 600 

CGAGTGCGGG AGCTGCTGCT GGACGCCCTG AGCCCCGAGC AGCTAGTGCT CACCCTCCTG 660 

GAGGCTGAGC CGCCCCATGT GCTGATCAGC CGCCCCAGTG CGCCCTTCAC CGAGGCCTCC 720 

ATGATGATGT CCCTGACCAA GTTGGCCGAC AAGGAGTTGG TACACATGAT CAGCTGGGCC 780 

AAGAAGATTC CCGGCTTTGT GGAGCTGAGC CTGTTCGACC AAGTGCGGCT CTTGGAGAGC 840 

TGTTGGATGG AGGTGTTAAT GATGGGGCTG ATGTGGCGCT CAATTGACCA CCCCGGCAAG 900 

CTCATCTTTG CTCCAGATCT TGTTCTGGAC XGGGATGAGG GGAAATGCGT AGAAGGAATT 960 

CTGGAAATCT TTGACATGCT CCTGGCAACT ACTTCAAGGT TTCGAGAGTT AAAACTCCAA 1020 

CACAAAGAAT ATCTCTGTGT CAAGGCCATG ATCCTGCTCA ATTCCAGTAT GTACCCTCTG 1080 

GTCACAGCGA CCCAGGATGC TGACAGCAGC CGGAAGCTGG CTCACTTGCT GAACGCCGTG 1140 

ACCGATGCTT TGGTTTGGGT GATTGCCAAG AGCGGCATCT CCTCCCAGCA GCAATCCATG 1200 

CGCCTGGCTA ACCTCCTGAT GCTCCTGTCC CACGTCAGGC ATGCGAGGTC TGCCTGA 1257 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 amino acids 

so .(B) TYPE: amino acid * 
(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



75 



20 



25 



30 



35 



40 



45 



55 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Asn Tyr Ser He Pro Ser Asn Val Thr Asn Leu Glu Gly Gly Pro 
1 5 10 15 

Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp Pro Thr Pro Gly His 

20 25 30 , - 

Leu Ser Pro Leu Val Val Hia Arg Gin Leu Ser Hia Leu Tyr Ala Glu 
35 40 45 

Pro Gin Lya Ser Pro Trp Cya Glu Ala Arg ser Leu Glu Hia Thr Leu 
50 55 60 

Pro Val Aan Arg Glu Thr Lau Lya Arg Lys Val Ser Gly Aan Arg Cya 

65 70 75 eo 

Ala ser Pro Val Thr Gly Pro Gly Ser Lya Arg Aap Ala Hia Phe Cya 
85 90 95 

Ala Val Cya Ser Aap Tyr Ala Ser Gly Tyr Hia Tyr Gly Val Trp Ser 
100 105 no 

Cya Glu Gly cya Lys Ala Phe Phe Lys Arg Ser He Gin Gly His Asn 
115 120 125 

Aap Tyr He Cys Pro Ala Thr Asn Gin Cya Thr He Aap Lya Aan Arg 
130 135 140 



Gly 



Arg Lya Ser Cys Gin Ala Cya Arg Leu Arg Lya Cys Tyr Glu Val 
145 »0 155 160 

Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys Gly Tyr Arg Leu Val 
165 170 175 

Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His Cys Ala Gly Lya Ala 
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180 



185 



190 



Lys Arg Ser Gly Gly Hi a Ala Pro Arg Val Arg Glu Leu Leu Leu Asp 

195 200 205 

Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu Leu Glu Ala Glu Pro 
210 215 220 

Pro His Val Leu lie Ser Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser 
225 230 235 240 

Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys Glu Leu Val His Met 
245 250 255 

He Ser Trp Ala Lys Lys He Pro Gly Phe Val Glu Leu Ser Leu Phe 

260 265 270 

Asp fila Val Arg Leu Leu Glu Ser Cys Trp Met Glu Val Leu Met Met 

275 280 285 

Gly Leu Met Trp Arg Ser He Asp His Pro Gly Lys Leu He Phe Ala 

290 295 300 

Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lys Cys Val Glu Gly He 

305 310 315 3 2 o 



Leu Glu He Phe Asp Met Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu 

325 330 335 

Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val Lys Ala Met He Leu 
340 345 350 

Leu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala Thr Gin Asp Ala Asp 
355 360 365 

Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala Val Thr Asp Ala Leu 

3 ™ 375 380 



Val Trp Val He Ala Lys Ser Gly He Ser Ser Gin Gin Gin Ser Met 
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385 390 395 400 

Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His Val Arg His Ala Arg 
405 410 415 

Ser Ala 



(2) INFORMATION FOR SBQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



30 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 22: 

CTTGGATCCA TAGCCCTGCT GTGATGAATT ACAG 34 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GATGGATCCT CACCTCAGGG CCAGGCGTCA CTG 33 
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(2) INFORMATION FOR SEQ ID NO: 24: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1698 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 

15 



20 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CACGAATCTT TGAGAACATT ATAATGACCT TTGTGCCTCT TCTTGCAAGG TGTTTTCTCA 60 

25 

GCTGTTATCT CAAGACATGG ATATAAAAAA CTCACCATCT AGCCTTAATT CTCCTTCCTC 120 

30 CTACAACTGC AGTCAATCCA TCTTACCCCT GGAGCACGGC TCCATATACA TACCTTCCTC 180 

CTATGTAGAC AGCCACCATG AATATCCAGC CATGACATTC TATAGCCCTG CTGTGATGAA 240 

35 TTACAGCATT CCCAGCAATG TCACTAACTT GGAAGGTGGG CCTGGTCGGC AGACCACAAG 300 

CCCAAATGTG TTGTGGCCAA CACCTGGGCA CCTTTCTCCT TTAGTGGTCC ATCGCCAGTT 360 

40 

ATCACATCTG TATGCGGAAC CTCAAAAGAG TCCCTGGTGT GAAGCAAGAT CGCTAGAACA 420 

CACCTTACCT GTAAACAGAG AGACACTGAA AAGGAAGGTT AGTGGGAACC GTTGCGCCAG 480 

45 

CCCTGTTACT GGTCCAGGTT CAAAGAGGGA TGCTCACTTC TGCGCTGTCT GCAGCGATTA 540 

CGCATCGGGA TATCACTATG GAGTCTGGTC GTGTGAAGGA TGTAAGGCCT TTTTTAAAAG *600 

50 

AAGCATTCAA GGACATAATG ATTATATTTG TCCAGCTACA AATCAGTGTA CAATCGATAA 660 

55 AAACCGGCGC AAGAGCTGCC AGGCCTGCCG ACTTCGGAAG TGTTACGAAG TGGGAATGGT 720 
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GAAGTGTGGC TCCCGGAGAG AGAGATGTGG GTACCGCCTT GTGCGGAGAC AGAGAAGTGC 780 

5 

CGACGAGCAG CTGCACTGTG CCGGCAAGGC CAAGAGAAGT GGCGGCCACG CGCCCCGAGT 840 

GCGGGAGCTG CTGCTGGACG CCCTGAGCCC CGAGCAGCTA GTGCTCACCC TCCTGGAGGC 900 

10 

TGAGCCGCCC CATGTGCTGA TCAGCCGCCC CAGTGCGCCC TTCACCGAGG CCTCCATGAT 960 

is GATGTCCCTG ACCAAGTTGG CCGACAAGGA GTTGGTACAC ATGATCAGCT GGGCCAAGAA 1020 

GATTCCCGGC TTTGTGGAGC TCAGCCTGTT CGACCAAGTG CGGCTCTTGG AGAGCTGTTG 1080 

20 GATGGAGGTG TTAATGATGG GGCTGATGTG GCGCTCAATT GACCACCCCG GCAAGCTCAT 1140 

CTTTGCTCCA GATCTTGTTC TGGACAGGGA TGAGGGGAAA TGCGTAGAAG GAATTCTGGA 1200 

25 AATCTTTGAC ATGCTCCTGG CAACTACTTC AMGTTTCGA GAOTTAAMC TCCAACACAA 1260 

AGAATATCTC TGTGTCAAGG CCATGATCCT GCTCAATTCC AGTATGTACC CTCTGGTCAC 1320 

30 

AGCGACCCAG GATGCTGACA GCAGCCGGAA GCTGGCTCAC TTGCTGAACG CCGTGACCGA 1380 

TGCTTTGGTT TGGGTGATTG CCAAGAGCGG CATCTCCTCC CAGCAGCAAT CCATGCGCCT 1440 

35 

GGCTAACCTC CTGATGCTCC TGTCCCACGT CAGGCATGCG AGTAACAAGG GGATGGAAGA 1500 

TCTGCTCAAC ATGAAGTGCA AAAATGTGGT CCCAGTGTAT GACCTGCTGC TGGAGATGCT 1560 

40 

GAATGCCCAC GTGCTTCGCG GGTGCAAGTC CTCCATCACG GGGTCCGAGT GCAGCCCGGC 1620 

4 

45 AGAGGACAGT AAAAGCAAAG AGGGCTCCCA GAACCCACAG TCTCAGTGAC GCCTGGCCCT 1680 

GAGGTGAACT GGCCCACAGA GGTCACAAGC TGAAGCGTGA ACTCCAGTGT GTCAGGAGCC 1740 

% 

50 TGGGCTTCAT CTTTCTGCTG TGTGGTCCCT CATTTGGTGA TGGCAGGCTT GGTCATGTAC 1800 

CATCCTTCCC TCCACCTTCC CAACTCTCAG GAGTCGGTGT GAGGAAGCCA TAGTTTCCCT I860 

55 
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TGTTAGCAGA GGGACATTTG AATCGAGCGT TTCCACAC 1898 
<2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 530 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNE5S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Asp He Lys Asn Ser Pro Ser Ser Leu Asn Ser Pro Ser Ser Tyr 

15 10 15 

Asn Cys Ser Gin Ser He Leu Pro Leu Glu His Gly Ser He Tyr He 
20 25 30 

Pro Ser Ser Tyr Val Asp Ser His His Glu Tyr Pro Ala Met Thr Phe 
35 40 45 

Tyr Ser Pro Ala Val Met Asn Tyr Ser He Pro Ser Asn Val Thr Asn 
50 55 60 

Leu Glu Gly Gly Pro Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp 
65 70 75 80 

Pro Thr Pro Gly His Leu Ser Pro Leu Val Val His Arg Gin Leu Ser 
85 90 95 

His Leu Tyr Ala Glu Pro Gin Lys Ser Pro Trp Cys Glu Ala Arg Ser 
100 105 110 

Leu Glu His Thr Leu Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val 
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115 



120 



125 



Ser Gly Asn Arg Cys Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg 
130 135 140 



w 



Asp Ala His Phe Cys Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His 
145 150 155 160 



15 



Tyr Gly Val Trp Ser Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser 
165 170 175 

lie Gin Gly His Asn Asp Tyr He Cys Pro Ala Thr Asn Gin Cys Thr 
180 185 190 



20 



He Asp Lys Asn Arg Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys 
195 200 205 



25 



Cys Tyr Glu Val Gly Met Val Lya Cys Gly Ser Arg Arg Glu Arg Cys 
210 215 220 



30 



35 



Gly Tyr Arg Leu Val Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His 
225 230 235 240 

Cys Ala Gly Lya Ala Lya Arg Ser Gly Gly His Ala Pro Arg Val Arg 
245 250 255 

Glu Leu Leu Leu Asp Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu 
260 265 270 



40 



Leu Glu Ala Glu Pro Pro His Val Leu He Ser Arg Pro Ser Ala Pro 
275 280 285 



45 



Phe Thr Glu Ala Ser Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys 
290 295 300 



50 



Glu Leu Val His Met He Ser Trp Ala Lys Lys He Pro Gly Phe Val 
305 310 315 320 

Glu Leu Ser Leu Phe Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met 



55 
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325 



330 



335 



20 



Glu Val Leu Met Met Gly Leu Met Trp Arg Ser He Asp His Pro Gly 
340 345 350 

Lys Leu He Phe Ala Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lya 
355 360 365 

Cys Val Glu Gly He Leu Glu He Phe Asp Met Leu Leu Ala Thr Thr 
370 375 380 

Ser Arg Phe Arg Glu Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val 
385 390 395 400 

Lys Ala Met He Leu Leu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala 
405 410 415 



25 



Thr Gin Asp Ala Asp Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala 
420 425 430 



30 



Val Thr Asp Ala Leu Val Trp Val He Ala Lys Ser Gly He Ser Ser 
435 440 445 



35 



Gin Gin Gin Ser Met Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His 
450 455 460 



Val Arg His Ala Ser Asn Lys Gly Met Glu His Leu Leu Asn Met Lys 
465 470 475 480 

Cys Lys Asn Val Val Pro Val Tyr Asp Leu Leu Leu Glu Met Leu Asn 
485 490 495 



45 



Ala His Val Leu Arg Gly Cys Lys Ser Ser He Thr Gly Ser Glu Cys 
500 505 510 



SO 



Ser Pro Ala Glu Asp Ser Lys Ser Lys Glu Gly Ser Gin Asn Pro Gin 
515 520 525 



55 



Ser Gin 
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530 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTGCGGATCC TCTCAAGACA TGGATATAAA 
(2) INFORMATION FOR SEQ ID NO: 27: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AGTAACAGGG CTGGCGCAAC GGTTC 
(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) ST HANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
ACTGGCGATG GACCACTAAA GG 22 



25 Claims 

1 . Isolated DN A encoding a protein having an N-terminal domain, a DNA-binding domain and a ligand-binding domain, 
wherein the amino acid sequence of said DNA-binding domain of said protein exhibits at least 80% homology with 
the amino acid sequence shown in SEQ ID NO:3, and the amino acid sequence of said ligand-binding domain of 

30 said protein exhibits at least 70% homology with the amino acid sequence shown in SEQ ID NO:4. 

2. Isolated DNA according to claims 1 , characterized in that the amino acid sequence of said DNA-binding domain 
of said protein exhibits at least 90%, preferably 95%, more preferably 98%, most preferably 100% homology with 
the amino acid sequence shown in SEQ ID NO: 3. 

35 

3. Isolated DNA according to claims 1 or 2, characterized in that the amino acid sequence of said ligand-binding 
domain of said protein exhibits at least 75%, preferably 80%, more preferably 90%, most preferably 1 00% homology 
with the amino acid sequence shown in SEQ ID NO:4. 

40 4. Isolated DNA according to claims 1 to 3, said DNA encoding a protein comprising the amino acid sequence of 
SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:21 or SEQ ID NO:25. 

5. Isolated DNA according to claims 1 to 4, characterized in that said DNA comprises the nucleic acid sequence of 
SEQ ID NO:1 , SEQ ID NO:2, SEQ ID NO:20 or SEQ ID NO:24. 

6. A recombinant expression vector comprising the DNA according to any of the claims 1 to 5. 

7. A cell transfected with DNA according to claims 1 to 5 or an expression vector according to claim 6. 

so 8. A cell according to claim 7 which is a stable transfected cell line which expresses the steroid receptor protein 
according to any of the claims 9 to 11 . 

9. Protein encoded by DNA according to claims 1 to 5 or an expression vector according to claim 6. 

ss 10. Protein according to claim 9, said protein comprising the amino acid sequence of SEQ ID NO:5, SEQ ID NO:6, 
SEQ ID NO:21 or SEQ ID NO:25. 

11. Chimeric protein having an N-terminal domain, a DNA-binding domain, and a ligand-binding domain, 
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characterized in that at least one of said domains of said chimeric protein originates from a protein according 
to claims 9 or 10, and at least one of the other domains of said chimeric protein originates from another receptor 
protein from the nuclear receptor superfamily, provided that the DN A-binding domain and the ligand-binding domain 
of said chimeric protein originates from different proteins. 

12. DNA encoding a protein according to claim 11 . 

13. Use of a DNA according to claims 1 to 5 or 12, an expression vector according to claim 6, a cell according to claim 
7 or 8 or a protein according to claim 9 to 11 in a screening assay for identification of new drugs. 

14. A method for identifying functional ligands for the protein according to claims 9 to 11 , said method comprising the 
steps of 

a) introducing into a suitable host cell 1 ) DNA according to claims 1 to 5 or 1 2, and 2) a suitable reporter gene 
functionally linked to an operative hormone response element, said HRE being able to be activated by the 
DNA-binding domain of the protein encoded by said DNA; 

b) bringing the host cell from step a) into contact with potential ligands which will possibly bind to the ligand- 
binding domain of the protein encoded by said DNA from step a); 

c) monitoring the expression of the protein encoded by said reporter gene of step a). 
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Transient transfection of CHO cells with Estrogen Receptors Alpha and Beta 
Incubation with Estradiol and ICI 
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